Regression models, scan statistics and reappearance probabilities to detect regions of association between gene expression and copy number.

نویسندگان

  • Jennifer L Asimit
  • Irene L Andrulis
  • Shelley B Bull
چکیده

Early studies of breast cancer microarray data used linear models to quantify the relationship between measures of gene expression (GE) and copy number (CN) obtained from tumour samples. Motivated by a study of women with axillary node-negative breast cancer, we propose a regression-based scan statistic to identify within-chromosome clusters of genetic probes that exhibit association between GE and CN, while accounting for tumour characteristics known to be prognostic for clinical outcome. As a measure of the association between GE and CN, for each genetic probe available from a microarray we regress GE on CN, and include subject-specific covariates. In the development of the scan statistic, the within-chromosome spatial distribution of the subset of probes with a statistically significant association is approximated by a Poisson process. By incorporating the distance between the probe positions, the scan statistic accounts for the spatial nature of CN alterations. Regions identified as clusters of significant associations are hypothesized to harbour genes involved in breast cancer progression. Using simulations, we examine the sensitivity of the method to certain factors, and to address issues of repeatability, we consider reappearance probabilities for each probe within detected regions and assess the utility of a quantity estimated by bootstrap sample frequencies. Applications of the proposed method to joint analysis of GE and CN in breast tumours, with and without an informative covariate, and comparisons with alternative methods suggest that inclusion of covariates and the use of a regional test statistic can serve to refine regions for further investigation including the analysis of their association with outcome.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

BIRC5 Genomic Copy Number Variation in Early-Onset Breast Cancer

Background: Baculoviral inhibitor of apoptosis repeat-containing 5 (BIRC5) gene is an inhibitor of apoptosis that expresses in human embryonic tissues but it is absent in most healthy adult tissues. The copy number of BIRC5 has been indicated to be highly increased in tumor tissues; however, its association with the age of onset in breast cancer is not well understood. Methods: Forty tumor tiss...

متن کامل

Relationship Between PIK3CA Amplification and P110α and CD34 Tissue Expression as Angiogenesis Markers in Iranian Women with Sporadic Breast Cancer

Background and Objective: The PI3K/AKT/mTOR pathway is known to play an important role in regulating angiogenesis both in normal and breast cancer (BC) tissues. PIK3CA amplification was reported in various malignancies, including approximately 10% of BC cases. The aim of this study was to identify the frequency of PIK3CA amplification in Iranian female patient...

متن کامل

Increased Expression of CYP2E1 Gene in Gastric Cancer May be a Molecular Marker for Mazandaran Province Population

Cytochrome P450 2E1 (CYP2E1) enzyme metabolically activates a large number of low molecular mass xenobiotics probably involved in gastric cancer incidence through activation of procarcinogens. North of Iran is amongst high incidence rate areas of gastric carcinoma where environmental carcinogenic compounds, including agricultural pesticides, are massively used. In this report, we quantitatively...

متن کامل

Prediction of Blasting Cost in Limestone Mines Using Gene Expression Programming Model and Artificial Neural Networks

The use of blasting cost (BC) prediction to achieve optimal fragmentation is necessary in order to control the adverse consequences of blasting such as fly rock, ground vibration, and air blast in open-pit mines. In this research work, BC is predicted through collecting 146 blasting data from six limestone mines in Iran using the artificial neural networks (ANNs), gene expression programming (G...

متن کامل

Assessing Experimental and Intelligent Models in Estimating Reference Evapotranspiration

Introduction: As the most important element in the hydrologic cycle which depends on climate variables such as near-ground wind speed, air temperature, solar radiation, and relative humidity,  reference evapotranspiration (ET0) is normally computed through a variety of methods, each of which requires different and in some cases extensive data that are unavailable in many circumstances, especial...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Statistics in medicine

دوره 30 10  شماره 

صفحات  -

تاریخ انتشار 2011